Consistency Properties of Species Tree Inference by Minimizing Deep Coalescences
نویسندگان
چکیده
Methods for inferring species trees from sets of gene trees need to account for the possibility of discordance among the gene trees. Assuming that discordance is caused by incomplete lineage sorting, species tree estimates can be obtained by finding those species trees that minimize the number of "deep" coalescence events required for a given collection of gene trees. Efficient algorithms now exist for applying the minimizing-deep-coalescence (MDC) criterion, and simulation experiments have demonstrated its promising performance. However, it has also been noted from simulation results that the MDC criterion is not always guaranteed to infer the correct species tree estimate. In this article, we investigate the consistency of the MDC criterion. Using the multispecies coalescent model, we show that there are indeed anomaly zones for the MDC criterion for asymmetric four-taxon species tree topologies, and for all species tree topologies with five or more taxa.
منابع مشابه
From gene trees to species trees II: Species tree inference in the deep coalescence model
When gene copies are sampled from various species, the resulting gene tree might disagree with the containing species tree. The primary causes of gene tree and species tree discord include lineage sorting, horizontal gene transfer, and gene duplication and loss. Each of these events yields a different parsimony criterion for inferring the (containing) species tree from gene trees. With lineage ...
متن کاملSpecies Tree Inference by Minimizing Deep Coalescences
In a 1997 seminal paper, W. Maddison proposed minimizing deep coalescences, or MDC, as an optimization criterion for inferring the species tree from a set of incongruent gene trees, assuming the incongruence is exclusively due to lineage sorting. In a subsequent paper, Maddison and Knowles provided and implemented a search heuristic for optimizing the MDC criterion, given a set of gene trees. H...
متن کاملInference of Parsimonious Species Trees from Multi-locus Data by Minimizing Deep Coalescences
One approach for inferring a species tree from a given multi-locus data set entails computing a tree that optimizes a certain criterion. In 1997, W. Maddison proposed “minimizing deep coalescences”, or MDC, as one such criterion. This is a parsimonious criterion that, roughly speaking, seeks the tree that minimizes a quantity called extra lineages when all gene trees are reconciled within its b...
متن کاملConsistency and inconsistency of consensus methods for inferring species trees from gene trees in the presence of ancestral population structure.
In the last few years, several statistically consistent consensus methods for species tree inference have been devised that are robust to the gene tree discordance caused by incomplete lineage sorting in unstructured ancestral populations. One source of gene tree discordance that has only recently been identified as a potential obstacle for phylogenetic inference is ancestral population structu...
متن کاملThe accuracy of species tree estimation under simulation: a comparison of methods.
Numerous simulation studies have investigated the accuracy of phylogenetic inference of gene trees under maximum parsimony, maximum likelihood, and Bayesian techniques. The relative accuracy of species tree inference methods under simulation has received less study. The number of analytical techniques available for inferring species trees is increasing rapidly, and in this paper, we compare the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of computational biology : a journal of computational molecular cell biology
دوره 18 1 شماره
صفحات -
تاریخ انتشار 2011